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Abstract. We prove that tiie complexity of the uniform first-order theory of ground tree rewrite graphs 
is in ATIME(2^ , 0{n)). Providing a matching lower bound, we show that there is some fixed 

f^ . ground tree rewrite graph whose first-order theory is hard for ATIME(2^ , poly(n)) with respect 

OQ ' to logspace reductions. Finally, we prove that there exists a fixed ground tree rewrite graph together 

with a single unary predicate in form of a regular tree language such that the resulting structure has a 
3 ■ non-elementary first-order theory. 
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1 Introduction 



\^ I Pushdown systems (PDS) are natural abstractions of sequential recursive programs. Moreover, the positive 

)^ • algorithmic properties of their transition graphs (pushdown graphs) make them attractive for the verification 

C/3 I of sequential recursive programs. In particular, the model-checking problem for MSO (monadic second- 

CJ . order logic) and hence for most temporal logics (e.g. LTL, CTL) is decidable over pushdown graphs and 

precise complexity results are known (cf. [4,22,31,32]). In addition to recursion, concurrency is indis- 

P\| ■ putably another feature that commonly arises in real-world programs. Multithreading is often introduced 

K^ I by design, e.g., to achieve speedup or to make programming more convenient. 

Ground tree rewrite systems (GTRS) (cf. [6, 11, 12, 19]), which were also studied under the name 
ground term rewrite systems in the rewriting community, are an extension of PDS with the ability to spawn 
^S ■ new subthreads that are hierarchically structured, which in turn may terminate and return some values to 

1^ I their parents. While the rules of a PDS rewrite a prefix of a given word, the rules of a GTRS rewrite a 

^—v ■ subtree of a given tree. 

The transition graphs of GTRS (ground tree rewrite graphs) do not share all the nice algorithmic proper- 
ties of pushdown graphs. For instance, the infinite grid is easily seen to be (embeddable into) a ground tree 
rewrite graph, which implies that MSO is undecidable over GTRS. In fact, most linear-time and branching- 
time temporal logics such as LTL and CTL have undecidable model checking problems over GTRS (cf . [19, 
28]). On the positive side, reachability, recurrent reachability, and fair termination are decidable (cf. [6, 10, 



t^ . 19,27]). Moreover, first-order logic (FO) and first-order logic with reachability predicates are decidable 

[12]. This implies that model-checking of the CTL-fragment EF is decidable for GTRS; the precise com- 
plexity was recently clarified in [14]. 

While model-checking FO with reachability predicates is clearly non-elementary over GTRS (this holds 
already for the infinite binary tree, which is a pushdown graph [26]), the precise complexity of model- 
checking FO over GTRS is open, although the problem is known to be decidable since more than 20 years 
[12]. The main contribution of this paper solves this problem. We prove that (i) the first-order theory of 
every ground tree rewrite graph belongs to the complexity class ATIME(2^ , 0{n)) (doubly exponen- 
tial alternating time, where the number of alternations is bounded linearly) and (ii) that there exists a fixed 
ground tree rewrite graph with an ATIME(2^ , poly(n))-complete first-order theory. The upper bound 
of ATI ME (2^''°' " , 0{n)) even holds uniformly, which means that the GTRS may be part of the input. The 
complexity class ATIME(2^ , poly(n)) appears also in other contexts. For instance, Presburger Arith- 
metic (the first-order theory of (N, +)) is known to be complete for ATIME(2^ , poly(n)) [1], see [9] 
for similar results. 

The upper bound of ATIME(2^ , 0{n)) is shown by the method of Ferrante and Rackoff [13]. 
Basically, the idea is to show the existence of a winning strategy of the duplicator in an Ehrenfeucht- 
Fraisse game, where the duplicator chooses "small" (w.r.t. to a predefined norm) elements. This method is 
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one of the main tools for proving upper bounds for FO-theories. We divide the upper bound proof into two 
steps. In a first step, we will reduce the FO-theory for a ground tree rewrite graph to the FO-theory for a 
very simple word rewrite graph, where all word rewrite rules replace one symbol by another symbol. The 
alphabet will consist of all trees, whose size is bounded by a singly exponential function in the input size 
(hence, the alphabet size is doubly exponential in the input size; this is the reason for the doubly exponential 
time bound). Basically, we obtain a word over this alphabet from a tree t by cutting off some upward-closed 
set C in the tree and taking the resulting sequence of trees. Intuitively, the set C consists of all nodes u of 
t such that the subtree rooted in u is "large". Here, "large" has to be replaced by a concrete value jti e N 
such that a sequence of n rewrite steps applied to a tree t cannot touch a node from the upward-closed set 
C. Clearly, m depends on n. In our context, n will be exponential in the input size and so will m. In a 
second step, we provide an upper bound for the FO-theory of a word rewrite graph of the above form. 

For the lower bound, we prove in a first step hardness for 2N EXP (doubly exponential nondeterministic 
time). This is achieved by an encoding of a (2^ x 2^ ) tiling problem. In this tiling problem, we are given 
a word w of length n over some fixed set of tiles, and it is asked, whether this word can be completed to 
a tiling of an array of size (2^ x 2^ ), where the word w is an initial part of the first row. There exists 
a fixed set of tiles, for which this problem is 2NEXP-complete. From this fixed set of tiles, we construct 
a fixed GTRS such that the following holds: From a given word w of length n over the tiles, one can 
construct (in logspace) a first-order formula that evaluates to true in our fixed ground tree rewrite graph if 
and only if the word w is a positive instance of the (2^ x 2^ ) tiling problem. Our construction is inspired 
by [14], where it is shown that the model-checking problem for a fragment of the logic EF (consisting of 
those EF-formulas, where on every path of the syntax tree at most one EF-operator occurs) over GTRS is 
complete for the class pi^^xp ^^ ^ second step, we show that our 2NEXP lower bound can easily be lifted 
to ATIME(2^ , poly(n)). For this, we have to consider an alternating version of the (2^ x 2^ ) tiling 
problem. 

We conclude the paper with a proof sketch for the following result: There exists a fixed ground tree 
rewrite graph together with a single unary predicate in form of a regular tree language such that the result- 
ing structure has a non-elementary first-order theory. This result is shown by a reduction from first-order 
satisfiability of finite binary words, which is non-elementary [26]. It should be noted that the first-order 
theory of a pushdown graph extended by regular unary predicates still has an elementary first-order theory 
(it is an automatic structure of bounded degree, hence its first-order theory belongs to 2EXPSPACE [18]). 

Related work. Other related problems over GTRS have been shown to be decidable: confluence [11,8], 
model checking fragments of LTL [29], and generalized recurrent reachability [28]. Several other exten- 
sions of PDS with multithreading capabilities have been considered in [5, 16,20,24]. Among these exten- 
sions, the class of process rewrite systems [20], which generalize both Petri nets and pushdown systems by 
providing hierarchical structures to threads, seem to have tight connections with GTRS. Recently, Mayr's 
process rewrite systems hierarchy [21] was refined via GTRS [15]. 

The class of ground tree rewrite graphs is contained in the class of tree automatic structures [2], whose 
FO-theories are (non-elementarily) decidable. In [18], it is shown that (i) for every tree automatic structure 
of bounded degree (which means that the Gaifman-graph has bounded degree) the FO-theory belongs 
to 3EXPTIME and that there is a fixed tree automatic structure of bounded degree with a 3EXPTIME- 
complete FO-theory. Note that in general, ground tree rewrite graphs are not of bounded degree. 

Further applications of the method of Ferrante and Rackoff in computer science can be found in [25] 
(for the theory of queues) and in [17] (for nested pushdown trees). 



2 Preliminaries 

By Z we denote the integers and by N = {0, 1, . . .} the set of non-negative integers. For ijjEl. we define 
the interval [i,j] = {i,i + l,...,j} and [j] = [0, .?]. 

For an alphabet A (possibly infinite), we denote with A+ = yl* \ {e} the set of all non-empty words 
over A. The length of the word w E A* is denoted by \w\. For _B C A, we denote with \w\b the number of 
occurrences of symbols from B in the word w. 



Let / : A — > _B be a mapping. For A' C A, we denote with / \A' : A' ^ B the restriction of / to 
A'. For sets A, B, C (where A and B may have a non-empty intersection) and two mappings f : A ^^ C 
and g : B ^ C , we say that / and g are compatible if f\(Ar\ B) = g\{Ar) B). Finally, for mappings 
f : A ^ C and g : B -^ C with ^ n i? = 0, we define / l+) g : ^ U B -^ C as the mapping with 
(/ W 9){o) = I {a) for a G A and (/ W g)(6) = g{b) for 6 e B. 

2.1 Complexity theory 

We will deal with alternating complexity classes, see [7,23] for more details. An alternating Turing- 
machine is a nondeterministic Turing-machine, where the set of states is partitioned into existential and 
universal states. A configuration with a universal (resp. existential) state is accepting if every (resp. some) 
successor configuration is accepting. An alternation in a computation of an alternating Turing-machine is 
a transition from a universal state to an existential state or vice versa. For functions t{n) and a{n) with 
a{n) < t(n) for all n > let ATIME(t(n), a(n)) denote the class of all problems that can be solved on an 
alternating Turing-machine in time t{n) with at most a{n) alternations. It is known that ATIME(i(n), t(n)) 
is contained in DSPACE(t(n)) if t(n) > n [7]. 

2.2 Labelled graphs 

A (directed) graph is a pair (V, -^), where y is a set of nodes and -^ C V x V is a binary relation. A 
labelled graph is a tuple ~ (V, S, {— >| a E £'}), where V is a set of nodes, 17 is a finite set of ac- 
tions, and — > is a binary relation on V for all a € E. We note that (labelled) graphs may have infinitely 
many nodes. For u,v G V, we define dtg,{u, v) as the length of a shortest undirected path between u and 
V in the graph (V^,Uae2; ~^)- ^°^ n G N and u G F let S'„(0,w) ~ {v E V \ de,{u,v) < n} be the 
sphere of radius n around u. Moreover, for ui, . . . ,Uk G V let S'„(©, ui, . . . , Uk) = Ui<i<fe Sn{&, Ui). 
We identify 5„(0, ui, . . . , Uk) with the substructure of © induced by the set S'„(©, ui, . . . , Uk), where in 
addition every Ui {1 < i < k) is added as a constant. For two labelled graphs ©i and ©2 with node 
set Vi and V2, respectively, and nodes ui, . . . ,Uk G Vi, vi, . . . ,Vk G V2, we will consider isomor- 
phisms / : S'„(©i, ui, . . . , Uk) — > Sn{&2,vi, . . . , Vk)- Such an isomorphism has to map Ui to Vi. We 
write S'„(©i,'Ui, ... ,Ufc) = S'„(©2,wi, ... ,Wfc) if there is an isomorphism / : 5„(©i,mi, . . . , Ufc) — > 

S'„(©2,Wl,---,Wfc)- 

Lemma 1. Let &i, (52 be labelled graphs with the same set of actions and node set Viand V2, respectively. 

Let u G Vi, V G V2, u G Vi, and w G V2 such that u ^ S'2n+i(©i, u) and v ^ 5'2n+i(©2 7 ^')- Finally, 
let f : S„{(&i,u) -^ 5„(©2,i') and f : S'„(©i,'u) — > S'„(©2,w) be isomorphisms. Then f & f : 
Sn{&i,u,u) — > S'„(©2,w,w) is an isomorphism as well. 

Proof. The lemma is obvious, once one realizes that the condition u ^ S'2„+i(©i,u) implies that the 
spheres S'„(©i,u) and Sn{&i,u) are disjoint and that there is no edge between the two spheres (and 
similarly for the spheres 5'n (©2 , w) and Sn{&2,v)). D 

Later, we have to lift a relation ^ on a set ^ to a larger set. We will denote this new relation again by 
-^. Two constructions will be needed. Assume that — > is a binary relation on a set A and let ACS. We lift 
— !> to the set i?+ of non-empty words over B as follows: For all u,v <E B^, we have u ^ v if and only if 
there are x, y G B* and a,b E A such that a ^^ b and u = xay, v — xby. Note that this implies \u\ = \v\. 
The second construction lifts -^ Q A x A from ^ to N x A as follows: For a,b E A and to, n G N let 
(to,, a) -^ {n, b) if and only if m — n and a ^ b. Note that (N x A, — >) consists of Ho many disjoint 
copies of {A, -^). Moreover, {{A U {$})^ \ {$}^, ^) (where $ ^ A is a new symbol) is isomorphic to 
(N X A+,^). 

For a labelled graph © == (V, S, {— >| a G S}), we define the labelled graph 

&+ = {V+,S,{^\aeS}). 
Note that by the above definition, — > is lifted to a relation on V^. 



2.3 First-order logic 

We will consider first-order logic with equality over labelled graphs. Thus, for a set S of actions, we 
have for each a E S a binary relation symbol a{x, y) in our language. The meaning of a(x, y) is of course 
X — > y. lfip{xi, . . . ,x„) is a first-order formula with free variables xi, . . . ,x„, © ~ (V, Z", {— >| a G U}) 
is a labelled graph, and wi , . . . , u„ G y, then we write © (= f{vi , . . . ,Vn) if ip evaluates to true in ©, when 
variable Xi is instantiated hy Vi (1 < i < n). The first-order theory of a labelled transition graph © is the 
set of all first-order sentences (i.e., first-order formulas without free variables) (p with © |= </?. In the final 
Section 5, we will consider the first-order theory of a labelled graph with an additional unary predicate. 
The quant fier rank of a first-order formula is the maximal number of nested quantifiers in (p. We will need 
the following well known lemma, which goes back to work of Fischer and Rabin. 

Lemma 2. Let S be a set of actions. Given a first-order formula 6{x, y) of quantifier rank qr(0) and a 
binary-coded integer j (let m be the number of 1-bits in the binary representation ofj), one can compute in 
logspace a first-order formula 6^(x,y) of quantifier rank 0(log{j) + qr (9)) and size 0{m-log{j)-\-m- \9\) 
such that for every labelled graph © = {V, S, {— J-j a G S}) and all nodes u,v G V we have: © \^ 
0^ (m, v) if and only if there is a (directed) path of length j from u to v in the graph {V,{{s,t) \ © (= 

e{s,t)}). 

Proof Before we define 6^ (x, y), let us inductively define for each A: G N a formula V'fc {x, y) such that for 
all u,v G V we have © ^ ipk{u, v) if and only if there is a (directed) path of length 2'' from u to w in the 
graph (F, {(s, i) I © h ^(s, *)})■ We define 

^o{x,y) = 0{x,y), and 

ipk{x,y) — 3z\/u,vl ((u — xAv^z)\/{u = zAv = y)) — ^ ipk~i{u,v) 1 for fc > 1. 

Note that the size of tpk{x,y) is 0(fc + |6'|) and the quantifier rank is 3k + qr(6). 

Let t/ C N be the set of all positions of the binary representation of j whose bit is set to 1, i.e. 
j — X^ieLf 2^ Let m = \U\ and let hi, . . . , hm be some enumeration of U. We can now define 6^ {x, y) as 

9^{x,y) = 3xi,. ..,Xm+lixi = X A Xm+l = y A /\ Xph,{xi,Xi+i)j. 

From the binary representation ofj, we can easily compute 6^{x,y). Moreover, the size of 0^{x,y) is 
bounded by 0{m ■ log(j) + m ■ \0\) and the quantifier rank is bounded by 0(log(j) + qr(6')). D 

One of most successful techniques for proving upper bounds for the complexity of first-order theories 
is the method of Ferrante andRackoff [13]. We will apply this method in Section 3.2. Let us briefly explain 
the main result from [13].^ Let © be a labelled graph, and let V be the set of nodes of ©. Assume that for 
every node w G V we have a norm |ti| G N (in our application, V will be a set of words and the norm of a 
word will be its length). Let Vn = {v eV \\v\ < n}. Moreover, for fc, ^ > 0, let =k.i be an equivalence 
relation on the set V^ and let _ff : N^ — > N be a function such that the following properties hold for all 
fc,^G N,M,wG V^: 

- If u =kfi V, then u and v satisfy the same quantifier-free formulas in the structure ©. 

- If u =k,e V and £ > 0, then for all u G V there exists v G VH{k.i) with (u, u) =k+i,i-i (^7 ''^)- 

Then, for every quantifier-free formula i/'(xo, . . . , xg) and all quantifiers Qo, ■ ■ ■ ,Qe G {3, V} we have that 
® h Qoxo ■ ■ ■ QgXi : ip{xo, ...,Xi) if and only if 

© h i9oa;o e VH(o,i)QiXi e VH(i,e-i) ■ ■ ■ Qexi G VH(efi) ■ V'Ca^o, • • • , Xi). 



The actual statement in [13] is stronger, but for our purpose the weaker statement below is sufficient. 



2.4 Trees 

Let :< denote the prefix order on N*, i.e., x ^ y for x, y G N* if there is some z e N* such that y — xz. 
A set I? C N* is called prefix-closed if for all x,y E \f4*, x ^ y E D impHes x E D. A ranked alphabet 
is a collection of finite and pairwise disjoint alphabets A = (Ai)jg[j.] for some A; > such that Aq ^ 0. 
For simplicity we identify A with Uierfci ^j- ^ ranked tree (over the ranked alphabet A) is a mapping 
t : Dt -^ A, where Dt C [1, fc]* satisfies the following: 

- Dt is non-empty, finite and prefix-closed, 

- for each x E Dt with t{x) e Ai we have xl, . . . , xi e Dt and xj ^ Dt for each j > i. 

We say that Dt is the domain of t - we call its elements nodes. In case t(x) G A2 for some node x, then 
xl is the left child and x2 the right child of x. A leaf of t is a node x with t(x) G Aq. An internal node of 
t is a node, which is not a leaf. We also refer to e G -Dt as the root of t. By Trees^i we denote the set of 
all ranked trees over the ranked alphabet A. Define size(t) as the number of nodes in a tree t. It is easy to 
show that the number of all trees from Trees^ of size at most n is bounded by \A\". 

Let f be a ranked tree and let x be a node of i. For each x G [1, fc]* we define xDt = {xy G [1, fc]* | y G 
Dt} and x^^ Dt ~ {y E [1, fc]* \ xy E Dt}. By t^^ we denote the subtree oft with root x, i.e. the tree with 
domain Dti^, — x~^Dt defined as t^^{y) = t{xy). Let s, i G Trees^ and let x be a node of t. We define 
t[x/s\ to be the tree that is obtained by replacing i-l-^ intbys, more formally Dtj^^/s] = {Dt\xDti^)[JxD g 
with 

I s[z) it y = xz with z E Dg. 

For two ranked trees s and t, let difF(s, t) — \Ds\ Dt\. Thus difF(s, t) is the number of nodes that belong 
to the tree s but not to the tree t. Let C be a prefix-closed subset of Dt. We define the string of subtrees 
t \ C as follows: If C = 0, then t\C ^ t.lfC ^ 9, then i \ C = t-^^^ • • • t^^-, where wi, . . . , w„ is 
a list of all nodes from ((C • N) n Dt) \ C in lexicographic order. Intuitively, we remove from the tree t 
the prefix-closed subset C and list all remaining maximal subtrees. For n G N and a tree t we define the 
prefix-closed subset up(t, n) C Dt as 

up{t,n) = {v E Dt\ s\ze{t^^) > n}. 

Note that t \ up{t, n) is a list of all maximal subtrees of size at most n in t, listed in lexicographic order. 

A tree t E Trees^ is a chain if Dt ^ {e} and for every internal node u E Dt, there is at most one child 
ui of u such that ui is internal. Hence, a chain t has a unique maximal (with respect to the prefix relation) 
internal node max(f) G N*. 

Lemma 3. Let A be a ranked alphabet and let ranks = {m G N | tti > I, Am ^ 0}. Then, for all n > 1, 
the following are equivalent: 

(a) There is a chain t E Trees^ with exactly n leaves. 

(b) There is a tree t E Trees^i with exactly n leaves. 

(c) There exist numbers dm E N (for each m E Ar) such that n = 1 + X^meranks '^™ ' ('^ ^ -'^)- 

Proof. Implication (a) ^ (b) is trivial. Now, assume (b) and let t E Trees/i has exactly n leaves. We show 
(c) by induction on the size of t. We distinguish two cases. The case n = 1 is clear. Now, assume that t 
has n > 2 leaves. Then, there must exist an internal node u E Dt such that all children of u are leaves. 
Let a > 1 be the rank of the symbol t{u). By replacing u by a leaf (labelled with an arbitrary constant 
from Aq), we get a strictly smaller tree with n — (a — 1) many leaves (note that a = 1 is possible). Hence, 
there exist dm £ N i'm G ranks) such that n — (a — 1) = 1 + YlmeM '^"i ■ {m — 1). Thus, we have 

n = 1 + (da + 1) • (a - 1) + Emeranks\{a} dra ' {m - 1). 

Finally, for the implication (c) ^ (a), assume that n = 1 + X^meranks dm ■ {m — 1). Take a chain t 
that consists of X^meranks dm internal nodes, dm of which are labelled with a symbol of rank m. All other 
nodes are leaves. It is a simple observation that t has exactly n leaves. D 

The following lemma follows directly from Lemma 3. 



Lemma 4. Let Abe a ranked alphabet and let ranks = {m G N | ?n > 1, Am 7^ 0}- Then, for every tree 
t G TreeSyi and every prefix-closed subset C of Dt the following holds, where n is the length of the string 
t \ C: There exist numbers dm G N (for each m £ Ar) such that n = 1 + X^mPranks '^'" ' ("^ ~ -'^)- 

2.5 Ground tree rewrite graphs 

A ground tree rewrite system (GTRS) is tuple TZ — (A, S, R), where A is a ranked alphabet, S is finite set 
of actions, and TZ C Trees^ x S x Trees^ is a finite set of rewrite rules. A rule (s, a, s') is also written as 
s I — > s' . The ground tree rewrite graph defined by TZ is 

0(7^) ^ (TreesA, i:, {^| a e S}), 

where for each a E S,we have t — > t' if and only if there exist a rule (s 1 — > s') e R and x E Dt such 
that t^^ = s and t' = t[x/s']. The following two lemmas are obvious: 

Lemma 5. Let TZ ~ (A, S, R) be a GTRS and let r be the maximal size of a tree that appears in R. Let s 
and t be ranked trees such that ^0(7^) (s, t) < n. Then size(t) < size(s) + r ■ n. 

Lemma 6. Let TZ ~ (A, S, R) be a GTRS and let r be the maximal size of a tree that appears in R. Let s 
and t be ranked trees such that diff (s, t) > r ■ n. Then d^^-ji^ (s, t) > n. 

Lemma 7. Let TZ = (A, S, R) be a GTRS and let r be the maximal size of a tree that appears in R. Let t 
be a ranked tree, n G N, and let C C up(t, r ■ n) be prefix-closed. Then we have 

S„{&{TZ),t)9iSJ&{TZ)+,t\C). 

Proof. Let t\C — ti ■ ■ ■ tm- Hence, there is a tree s with m leaves such that t results from s by replacing 
the z* leaf of s by i^ (1 < i < m), let us write t = s[ti,t2, . ■ . , tm] for this. Recall that the subtree rooted 
in a node from C C up(t, r ■ n) has size strictly larger than r ■ n. Therefore, a node from C cannot be 
accessed by doing at most n rewrite steps. Hence, every tree t' G S'„(0(7?.), t) can be written (uniquely) as 
t' — s[t'i,t'2^ . . . , t'„]. Moreover, the mapping t' h-> t'it'2 ■ ■ -t'm defines an isomorphism from 5'„(©(7^), i) 
toS'„(©(7e)+,t\C). D 

Remark 8. Note that if the word w results from the string f \ C by permuting the trees in the string, then 

we still have 5„(©(7^),t) ^ 5„(0(7^) + , w). 

The main goal of this paper is to study the complexity of the following decision problem. 

(Uniform) first-order theory of Ground Tree Rewrite Graphs 

INPUT: A GTRS TZ = {A, S, R) and an FO-sentence ip of the signature of 0(7^). 

QUESTION: Does 0(7^) h'/'hold? 



3 An ATI M E(22'""'^"\ 0(n)) upper bound 

In this section we will prove the following result: 

Theorem 9. The uniform first-order theory of ground tree rewrite graphs is in ATIME(2^ , 0{n)). 

It suffices to prove Theorem 9 for the case that the underlying ranked alphabet A contains a symbol of 
rank at least two. A ground tree rewrite graph, where all symbols have rank at most 1 is in fact a suffix 
rewrite graph, i.e., pushdown graph. But every pushdown graph is first-order interpretable in a full |_r|-ary 
tree F* (with F finite), where the defining first-order formulas can be easily computed from the pushdown 
automaton (or the ground tree rewrite system). Finally, the first-order theory of a full tree F* (with \F\ > 2) 
is complete for the class ATIME(2'^("^ , 0{n)) (under log-lin reductions) [9, 30]. 



The proof of Theorem 9 will be divided into two steps. In a first step, we will reduce the FO-theory for 
a given ground tree rewrite graph to the FO-theory for a very simple word rewrite graph of the form 0+, 
where © is a finite labelled graph. Note that if V is the set of nodes of ©, then V^ is the set of nodes of 
25+ . Moreover, every edge in ©+ replaces a single symbol in a word by another symbol. In our reduction, 
the size of the set V will be doubly exponential in the input size (which is the size of the input formula plus 
the size of the input GTRS). In a second step, we will solve the FO-theory of a simple word structure ©"'" 
on an alternating Turing machine. More precisely, we will show the following result: 

Theorem 10. There exists an alternating Turing-machine M, which accepts all pairs (©, ip), where © is 
a finite labelled graph and ip is an FO-sentence over the signature ofQ5 with ©"^ [= ip. Moreover, M runs 
in time 0{r/ ■ |(p|), where n is the number of nodes of & and I is the quantifier rank of (p. Moreover, the 
number of alternations is bounded by 0{£). 

We prove Theorem 10 in Section 3.2. Together with our first reduction. Theorem 10 yields Theorem 9. 

3.1 Proof of Theorem 9 

In this section, we will prove Theorem 9. Let TZ — {A, S, R) be a GTRS over the ranked alphabet A and let 
r be the maximal size of a tree that appears in R. Let © = © (TZ) and let ip be an FO-sentence of quantifier 
rank £ + I over the signature of ©. We want to check, whether & \^ p. Define the sets 

ranks = {m e N | m > l,Am 7^ 0}, 

M = {1 + ^ dm • (m - 1) I dm G N for m G ranks}. 

rn G ra n ks 

help = max(ranks) > 2 denote the maximal rank of a symbol from A. We define a function int : M -^ 
N U {00} as follows: Let m G M. If Ai 7^ (i.e., there exists a unary symbol), then we set int(?7i) = 00. 
If Ai = 0, then let '\nt{m) be the maximal number of internal nodes in a tree t G Trees^i with exactly m 
leaves (this maximum exists; in fact int(m) < to — 1). The intuition behind setting int(TO,) = 00 in case 
Ai 7^ is that there exist arbitrarily large trees with m leaves. Also note that M = N \ {0} in case A2 7^ 0. 

Lemma 11. For every m E M we have int(?7i) > ^1-:^-. 

Proof. It suffices to show the lemma for the case Ai = 0. In this case, the lemma can be shown by induction 
on TO.. The case m =: 1 is clear. Let tti G M \ {1} and let t G TreesA be a tree with to. leaves and int (to) 
many internal nodes. Let u E Dthe an internal node such that all children of u are leaves. Let t{u) G Aq 
with q > 2. If we replace m by a leaf, we obtain a tree with int (to.) — 1 many internal nodes and m — q + 1 
many leaves. We must have int(?7i — g+l) = int(?7i) — 1 (if there would be a tree with tti — g + 1 leaves and 
more than int(TO) — 1 many internal nodes, then we would obtain a tree with m leaves and more than int(?TT.) 
many internal nodes by replacing an arbitrary leaf by a node with q children). Moreover, by induction (note 
that q > 2), we have int(TO -<? + !)> j^. Hence, we get int(TO) > ^ + 1 = "'p'+f'^ > ^ 
(since p > q). D 

Lemma 12. Assume that Ai — 0. For every m E M there exists a chain with to. leaves and int (to.) many 
internal nodes. 

Proof. Let to. G M. By definition, there exists a tree t G Trees/i with to. leaves and int(m) internal nodes. 
It is easy to restructure t into a chain so that the number of leaves and the number of internal nodes is not 
changed. D 

For < i < ^ let 

a{i) =£t-7-4^ • ((p-1)t-4^ + 1) +PT-4' <r2 ■p-2'^^^\ (1) 

Note that we have 

<y{i + 1) > <y{i) + p • r • 3 • 4* > a{i) + r • 3 • 4* (2) 



for all < i < ^. Let 

U = T[l,a{£) + r-p-4'^]. 

Moreover, for every < z < £ let 

U, = T[l,a{i)]CU, 

y, =T[l,r-4'] CU, 

Wi = {a{ui,. . . ,Uq) \ q>l,a e Aq,Ul,. . . ,Uq eVt}\Vi C U. 

Note that size(t) < r • p • 4* + 1 for all t G Wi. We consider the set C/ as a finite alphabet and the sets Ui, 
Vi, and Wi as subalphabets. Note that 

\U\ < |A|'^W+'--P-4'. (3) 

Define the language 

Z = {weU* \\w\e M} 

over the alphabet U. Note that Z = f7+ in case A2 ^ 0. On the set (N x Z) U C/ we define a labelled graph 
Si with label set R as follows: Take an action cr e Z". By our general lifting constructions from Section 2.2, 
the binary relation —> on Trees^ is implicitly lifted to a binary relation on TreesJ^ and N x Trees J^. Since 
(N X Z) n t/ = 0, — > can be viewed as a binary relation on (N x Z) U U; simply take the disjoint union 
of the relations on (N x Z) and U. Finally, we define the labelled graph 

ei = {{\hix z)uu, s, {^\(7 es}). (4) 

For a word w — U1U2 ■ ■ -Un <^U* with ui, . . . , m„ e [/ we define 



^1 = X!^'^^("')- 



We define the sets 



Zi = {we V*W^V* n Z \ \\w\\ + int(|w|) > cr(i)}, 
L, - (N X Z,) U t/,. 

Note that Zi = V*WiV* f) Z in case Ai 7^ (clearly, we set n + 00 = 00 for every number n). Assume 
that the first-order sentence tp is of the form QgXi ■ ■ ■ Qixi QqXq : ip, where Qo, ■ ■ ■ ,Qe G {V, 3} and ip 
is quantifier-free. For < z < ^ — 1 and elements s^+i , . . . , S£ G (N x Z) U C/ let us define the set 

Li{si+i,. . .,S£) = Lj U S3.^^{&l,Si+l,. . .,Si). 

We define a first-order sentence (pi (with quantifiers relativized to the sets Li{si+i, . . . , s^)) over the sig- 
nature of 6 1 as 

ipi ^ QiXi e LeQi^iXi^i e Li^i{xi) ■■■ Qqxq e Lo{xi,...,Xi) i-ip. (5) 

We want to show that & \= tpif and only if 61 ^ fi- For this, we need the following lemma 

Lemma 13. Assume that 

- o<i<e, 

- s = (si+1, . . . ,S£) e ((N X Z) U UY'"- with Sj e Lj U 5*3.43(61,5^+1, . . . ,Si)forallj e [i + l,i], 

- t ~ (ti+i, ■ . ■ ,ti) G Trees^"*, and 

- f : S4i+i{&i,s) — > 5*44+1(25, t) is an isomorphism such that f\S4i+i{&i, Sj) is the identity for all 
j G [i + 1,£] with tj G t/i+i or Sj G Ui+i. 

Then, the following holds: 



(a) For all U G Trees^i there exists Si G ii U 5*3.4; (61, s) and an isomorphism g : 5*44(61, Si^'s) — > 
5*4; (©, ti^t) such that f and g are compatible and g\S4i{&i, Sj) is the identity for all j £ [i, i] with 
tj G Ui or Sj G Ui. 

(b) For all Si E Li U S^.4i{&i,s) there exists ti G Trees^ and an isomorphism g : 54i(6i,Si,s) — > 
S'4i(©,ii, t) such that f and g are compatible and g\S4i{@i, Sj) is the identity for all j G [i,£] with 
tj G Ui or Sj G Ui. 

Proof. Let/ : 54i+i(6i,s) -^ 54i+i(©,f) be an isomorphism such that /|"54i+i (61, Sj) is the identity 
for alii + I < j < £ with tj G C/i+i or Sj G C/i+i- Let us first prove statement (a). Let ti G TreeSyi. We 
distinguish two cases: 

Case 1. ti G 53.4; (25, t). Note that this impHes that ti belongs to the range of the isomorphism / and that 

S^^{&,t^,t) CS'4.+i(©,i). 

Then, we set Si — f~^{ti) G 5*3. 44(61, s). We define g as the restriction of / to the set 544(61, Si,s) C 
5*44+1(61, s). Now, assume that ii G Ui, i.e., s\ze{ti) < CT(i). We have to show that /|'54i(6i, Si) is the 
identity. Let tj (i + I < j < £) such that d^ (ti, ij) < 3 • 4*. With Lemma 5 it follows 

. (2) 

size(ij) < s\ze(ti} + r • 3 • 4* < a{i) + r • 3 • 4* < cr(« + 1). 

Hence, tj G t/^+i and f\S4i+i{&i,Sj) is the identity. Since dQ^{si,Sj) — dig,{ti,tj) < 3 • 4*, we have 
544 (61 , Si) C 544+1 (61 , Sj). It follows that / IS^i (61 , Si) is the identity. If s^ G Ui, then we can argue 
analogously. 

Case 2. ti ^ 53.44(0,1). We will find Si G Li and an isomorphism /' : 544 (61, s^) -^ 544 (©,ti) such 
that Si ^ 53.44(61,5). Then, Lemma 1 implies that g = (/ ["544(61,5)) tiJ /' is an isomorphism from 
544(61,5^,5) to 544(©,ii,t), which is compatible with /. Moreover, we will show that if ti G Ui or 
Si G Ui, then /' is the identity. 

In order to find Si, let ti \ up{ti, r • 4*) = ui • ■ • Um. Recall that the latter string is the lexicographic 
order of all maximal subtrees of ti whose size is at most r • 4*. Hence, size(uj ) < r • 4* for each j. 

Case 2.L size(ii) > cr(i). We must have ti ^ ui, because otherwise size(ti) < r • 4* < (j{i), which is a 
contradiction. Therefore, there must exist 1 < j < m, a symbol a E A of rank q > 1, and a prefix-closed 
subset C C up{ti, r ■ 4*) such that a{uj, . . . , Uj+q-i) G Wi and 

tj \ C =: Ul • • •Mj_ia(Wj,. . . ,Uj+q^l)Uj+q ■ ■ ■ U-m- 

Let w — ti\C.'Ry Lemma 4, we have \w\ G M. By the definition of the mapping int, we have ||w|| + 
intduij) > size(ii) and hence ||w|| + int(|w|) > a{i) by assumption. Thus, we get w G Zi. Choose a 
number n G N such that n does not appear as a first component of a pair from {5j+i , . . . , 5^} n (N x Z). 
Finally, we set 

Si — {n,w) £ ^ X Zi C_ Li. 

Due to the choice of n, we have Si ^ 5p(6i, s) for all p. Moreover, with Lemma 7 we get 544 (61, 5^) ^ 
54.(6,^1). Finally, size(ti) > a{i), i.e., ti ^ Ui, and Si ^ U. 

Case 2.2. size(ti) < <j{i), i.e., ti G Ui. We set Si — ti e Ui. Note that 54»(©,ti) C U, which implies 
54i(6i,5i) = 544(0, ti). Assume that 5i G 53.44(61, s). We will deduce a contradiction. Let z+1 < j <£ 
such that dei {si, Sj) < 3 • 4*. Since 5^ G U, we must have Sj <E U as well (there is no path in 61 between 
the sets U and N x Z). Moreover, with Lemma 5 we get 

. (2) 

size(5j) < size(5j) + r • 3 • 4' < a{i) + r • 3 • 4* < cr(i + 1). 

This implies that / [54^+1 (61 , Sj) is the identity. Hence, ti G 53.44 (25, tj), a contradiction. We can finally 
choose for /' the identity isomorphism on 544 (61 , 5^) — S^i (©, ti). This proves (a). 
Let us now prove (b). Let 5^ G ii U 53. 4^ (61 , s). Again, we distinguish two cases. 



Case 1. Si G S'3.4i(©i, s). This implies 

Sii{<Si,Si,'s) C S'4>+i(6i,s). 
We set ti — f{si) E 5*3.4; (0, t). We can conclude as in Case 1 for point (a) above. 

Case2. Si ^ S'3.4i(6i,s). Hence, s^ G L^. We will find t^ G Trees/i and an isomorphism /' : S4^i{&i,Si) -^ 
841. (©, ti) such that ti ^ 5*3.4; (0,1). Then, Lemma 1 implies that the mapping g — [f 1841 {&i, s)) W /' is 
an isomorphism from 5*4; (61 , Si, s) to 54; (©, ii, t), which is compatible with /. Moreover, we will show 
that if ti G Ui or Si G Ui, then /' is the identity. 

Case 2.1. Si E Ui (^ TreeSyi- We set ti — Si E Ui, which implies ^4; (0, ti) C U. Thus, 54; (Si, Si) — 
54i(0,ti). Assume that ti G 53.44(0, i). We will deduce a contradiction. Let i + 1 < j < i such that 
de {ti, ij) < 3 • 4*. Lemma 5 implies 

. (2) 

size(tj) < size(ij) + r • 3 • 4* < a{i) + r • 3 • 4* < cr(« + 1). 

This implies that / |'54i+i (61, Sj) is the identity. Hence, Si G 53.4* (©1, Sj), a contradiction. We can finally 
choose for /' the identity isomorphism on 54* (Si, Si) = 841 (©, ti). 

Case 2.2. s^ G N x Zi. Let Si — {n,ui- ■ ■ Um) with ui, . . . , Um G V^ U Wi, m G M, and ||ui • • • Um\\ + 
int(TO) > cr(i). There is exactly one 1 < j < m with Uj G Wi. Let Uj — a{vi, . . . ,Vq) with q > 1, 
a G Aq, and wi, . . . , Wg G T^. Define the string 

W = Ul- ■■ Uj-lWl • • ■ VgUj+l ■■■Um (6) 

of length m + q— \. Since m G M, we also have to + g — 1 G M. 

Cflie 2.2.7. Ai ^ 0. Then, we can choose for ti a tree with the following properties: 

- up(ti, r • 4*) = w. For this, we connect all trees ui, . . . , Um to one tree using a chain of symbols of 
rank at least 2, starting from Uj. Since m G M, this is possible by Lemma 12 (and the remark after the 
proof of Lemma 12). 

- ti ^ 53.4; (0,1) and size(ti) > cr{i). This can be enforced by adding a long enough chain of unary 
symbols to the root. 

With Lemma 7, the first point implies 841 (Si, Si) ^ 54* (0, ti). Moreover, since size(ti) > (j{i), we have 

t^ ^ V^. 

Case 2.2.2. A2 ^ 0. Note that ||w|| = ||ui • • • Um\\ - h i.e., 

||w|| + int(?7i) = ||ui • • • Um\\ + int(?Ti) — 1 > (7{i). 

Every tree in the string w has size at most r • 4\ Hence, we have ||w|| < (to, + g — 1) • r • 4*. We get 

(m + (J - 1) • r • 4* + int(TO) > a{i). 

Moreover, since int(TO,) > ^^^^^x" ^Y Lemma 11, we have m+g—l < int(m)-(p— l)+g < \nt{m)^{p—l)+p. 
We get 

(int(m) • (p - 1) +p) • r • 4* + int(TO) > a{i). 

This is equivalent to 

int(m) > "- ^ ^ 



(p - 1) • r • 4» + 1 
Plugging in the definition of <T{i) from (1) yields 

int(TO) > ^ • r • 7 • 4*. (7) 

We now define £ + 1 different trees t[, . . . , t'^^^ as follows. 

We first fix a sequence ai, . . . , a;nt(m) of symbols from A\ Aq such that that every chain, where the 
j*'' internal node is labelled with aj has exactly m leaves. By Lemma 12 such a sequence exists. In the 
following, we consider chains with int(?7i) + 1 many internal nodes such that the following hold: 



- The j* internal node (1 < j < int(m)) is labelled with aj and the maximal internal node is labelled 
with a ^ Aq (thus, such a chain has m + q — 1 leaves). 

- Every internal node belongs to {1,2}* (thus, every internal node, which is not the root, is either the 
first or the second child of its parent node). 

- All leaves in the chain are labelled with some fixed constant O £ Aq. 

This means that such a chain is uniquely determined by its maximal internal node u = max(f) G {1,2}'"*^™). 
We write t = chain(u). 

Let u,v <E {1, 2}'"'(™) such that u — xay and v — xbz with x,y,z G {1, 2}*, a, 6 G {1, 2}, a ^ b. 
Define difF(u, v) == |y| + 1 (~ \z\ + 1). Recall also the definition of the diff-value for two trees from 
Section 2.4. Then, we have 

difF(chain(u),chain(w)) >difF(u, w). (8) 

In fact, difF(chain(u), chain(ti)) > 2 • difF(u, v) holds. 

Since int(m) > ^ • r • 7 • 4' by (7), we can find £ + 1 strings wi, . . . , wg+i G {1, 2}'"'('"' such that for 
all k ^ k' we have 

d\ffiwk,Wk')>r-7-A\ (9) 

We may for instance set 

_ -|int(m)-£-r-7-4'-|(fe--l)-r-7-4*2(^-'£+l)-'--7-4' 

Let us define the chain Ck = chain(u'fe) for all 1 < fc < £ + L Hence, (8) and (9) imply 

d[ff{ck,Ck')>r-7-4' (10) 

for all k ^ k'. Moreover, every chain Ck has exactly m + q ~ 1 leaves. Finally, the tree t'j, is obtained 
from the chain Ck as follows: We replace the q children of the maximal internal node max(cfe) (which 
is labelled with a G Aq) hy vi, . . . ,Vq (in this order). All other m — 1 leaves are replaced by the trees 
ui, . . . , Uj_i, Uj+i, . . . , Urn (the Order does not matter). It follows that the string ijj \ up(i5j, r ■ 4*) is a 
permutation of the string w from (6). With Lemma 7 (and the remark after the proof of Lemma 7) this 
ensures that 

for all 1 < fc < i* + 1. Moreover, since each of the trees ui, . . . ,Uj-i,vi, . . . , Vq,Uj+i, . . . , Um G Vi has 
size at most r • 4*, the number of nodes in the subtree of ij^ rooted at a leaf of Ck may grow by at most r • 4\ 
when we replace the leaf by one of the trees ui, . . . , Uj^i,vi, . . . , Vq,Uj^i, . . . , Um- This implies 

diff(i'fc,4) > diff(cfc,CfcO - r • 4* *> r • 6 • 4\ 

provided k ^ k' . Hence, Lemma 6 implies 

de(4,4)>6-4' (11) 

for all k 7^ k' . We claim that there is at least one 1 < fc < ^ + 1 such that t'^. ^ S^.4i ((5, t). In order to 
obtain a contradiction, assume that for each 1 < k < t + 1 there exists some th {i + 1 < h < () such 
that d0{t'f,,tfi) < 3 • 4*. Since the are only £ — i < £ such trees t^, the pigeon hole principle implies that 
there exist k ^ k' and h with d&{t'^,th) < 3 • 4' and de,{t'^,,th) < 3 • 4\ Hence, d&{t'^,t'y) < 6 • 4\ 
which contradicts (11). We finally set ti ~ t'^, where k is chosen such that tj. ^ 6*3.4; (25, t). Finally, note 
that size(ti) = ||ui • • -Uml + int(TO) > cr(i) (i.e., U ^ Ui) and Si ^ U. This concludes the proof of the 
lemma. D 



Lemma 13 allows us to prove the following lemma: 
Lemma 14. Assume that 

1 <i <^, 

- s = (si+i,...,S£) e ((N X Z)\JUf^''withsj G L-j U 83.41 {sj+i, ..., si) for all j G [{ + !,£], 

- t ~ (ti+i, ■ ■ ■ ,ti) G Trees^^*, and/ 

- / : 5*44+1(61, s) — > S'4i+i(0,t) is an isomorphism such that f\S4i+i{&i, Sj) is the identity for all 
j G [i + l,i'] w/f/z ij G Ui+i or Sj G C/i+i. 

r/zen, /or every quantifier-free first-order formula ip over the signature of& and all Qo, . . . ,Qi G {V, 3} 
we have 

&i h QiXt G -Lj(s) •••Qo2;o e Lo(xi,...,Xi,s) : ■0(a;o, • • • ,2;^, s) 

© h QiXi---QoXo : ■ip{xo,...,Xi,t). 

Proof. The lemma can be shown by induction on i, starting with i = —1. For the induction base (i ~ 
— 1), note that the existence of the isomorphism / ensures that s and t satisfy the same quantifier-free 
formulas. The induction step uses Lemma 13 and the classical back-and-forth argument from the proof of 
the Ehrenfeucht-Fraisse-Theorem. D 

Setting i = lin Lemma 14, it follows & ^ cpif and only if Si ^ (pi, where (pi is from (5). 

We will simplify the sentence ipi further. Recall that quantifiers in (pi are relativized to the sets 
Li{xi+i,. . . ,xi) = Li U 53.4>(6i,Xi+i,. . . ,Xf). Note that Xi G 53.4>(6i,Xi+i, ... ,X£) means that 
\l j=i+i dei{xi,Xj) < 3 • 4* holds. By Lemma 2 we can find an equivalent first-order formula of size 
0{{e -i) -1 + (e-i) ■ \R\) < 0{P + i ■ \R\) and quantifier rank Oil) < 0{i) (we take the formula 
9{x, y) = (x = y V \/„^^ <j(x, y) V a{y, x)) in Lemma 2; note that the binary representation of 3 • 4* 
has only 2 1-bits). After replacing the constraints Xi G 5*3.4; (61 , x^+i , . . . , Xi), the resulting equivalent 
sentence has size |(/3| + 0(^'^ + ^^ • ji^l) and quantifier rank 0{£). 

It remains to eliminate constraints of the form Xi G Li = (N x Z^) U V^. In order to do this, we 
will change the labelled graph ©i to a labelled graph of the form T+ for a finite labelled graph T. In the 
following, we assume that Ai — 0; the case Ai 7^ is the simpler one. 

In order to cope with the length constraint \w\ G M in the definition of the set Zi, we define for 
< i < ^ the sets 

U' = {w eU^ \\w\ + l e ranks}, 

F/ = {w G V+ I |w7| + 1 G ranks} C U' . 

Moreover, for < i < £letus define Wl as the set of all minimal words (with respect to the factor relation 
on words) w G V*WiV* with \w\ G M (and hence w £ Z) and ||w|| + int(|w|) > cr(i). It follows that for 
such a word w we have 

||w|| + int(|w|) - (p - 1) • r • 4* - 1 < cr(i). 

Since |w| < ||w|| and int(|w|) > _i by Lemma 11, we have 

, , \w\ — 1 



p-1 



- (p- l)-r -4'- 1 < cr(i). 



This is equivalent to 



Let us set 



\w\<— + {p-lf-r-'i\ (12) 

P 

^ + (p - 1)2 . r • 4^ < r' • 20(^) + p' ■ r ■ 2^^'\ 



which is an upper bound for the right-hand side of (12). Note that 7 is exponential in our input size. Let 

U" ^{w£Z\ \w\ < 7}, 
which contains all alphabets Wl and U'. Hence, 

\U"\ < i\U\ + ly < {\A\ + l)T-(-W+P-4^) < |^|2°m(r^.p+p'^.r'^)^ (^3^ 

which is doubly exponential in our input size. 

In the following, it is important that elements from U" are viewed as single symbols. For a word 
w G U"* we can define an expanded word exp(w) e U* in the natural way (e.g. ex.p{{a){abba){b){ba)) — 
aabbabba). Note that for every word w G U'*U"U'* we have |exp(«;)| 6 M (i.e., exp(?x;) e Z). Vice 
versa, for every word w E Z there exists at least one word w' £ U'*U"U'* with exp(i(;') = w. Moreover, 
for every word w G Vl*WlVl* we have |exp(w)| G M and ||exp(i(;)|| + int(|exp(w)|) > (y{i). Vice versa, 
if w G V*WiV* n Z with ||w|| + int(|w7|) > (j{i), then there exists at least one word w' G K*^/K* with 
exp(w') = ui. This allows us to replace the constraint set Zi — {w E V*WiV*r\Z \ ||i(;||+int(|w|) > cr{i)} 
by the set V'*WlV'*. Note that for a word w E Zi there may exist several words w' G V'*WlV'* with 
exp(w') = w. This is not a problem: by taking the set N x Z^ in the structure Si, we basically take Kq 
many copies of w. 

By our lifting construction from Section 2.2, every binary relation — > (cr G Z") on Trees^ is defined 
on U" C Trees jj and hence on (N x U'*U"U'*) U U. Using this, it follows that our labelled graph 
61 = ((N X Z)UU, S, { A| a G S}) is isomorphic to the labelled graph 

62 = ((N X U'*U"U'*) U U, S, {^1 a G S}). 

The isomorphism maps the constraint set L.^ = (N x Zi) U Vi to (N x V'*WlV'*) U Vi. 

In order to get rid of the direct product with N in N x V'*WlV'* we introduce a new symbol $. We lift 
the relations — > (u G Z") from U"* to {U" U {$})* in the standard way ($ does not occur in the left-hand 
and right-hand sides of the relations — >). Then, the labelled graph 62 (and hence ©i) is isomorphic to the 
graph 

63 = {{U' U {%}YU"{U' U {$})* U U, S, {^1 n G S}). 

The isomorphism maps U identically to U and the set N x {w} (for w G U'*U"U'*) is mapped to the set 
of words from ([/' U {$})*[/"([/' U {$})* \ U , whose projection onto the subalphabet U" is w. Hence, the 
constraint set N x V'*WlV'* is mapped to the set (¥( U {$})*W/(F/ U {$})* \ U. In order to express in 
first-order logic that a word belongs to this set, we introduce another symbol #. Hence, our final alphabet 
is 

r = L/"u {$,#}. 

The size of F can be estimated as 

|r|=2 + |L/"| <Vp"<^'(-*-^+^^-^). 
Next, we define a labelled graph T = (-T, i7', {— >| a G ^'}) as follows. The set of actions is 

s' = u\jr. 

The set of transitions is defined as follows. By our lifting construction from Section 2.2, every binary 
relation — > (ct G Z") on Trees^ is defined on F ($ and # do not occur in the left-hand and right-hand sides 
of the relations — >). Moreover, for a G -T we define the relation 

Finally, using the construction from Section 2.2, we define the labelled graph 

64 = 1+. 



We will construct a sentence (f4 over the signature of S4 such that ©i |= ipi if and only if 64 |= Lp4. 
Using the edge relations — > (a G F), we can express a; <E i7* (for il C F) as 

f\ ^3y:a{x,y). 

aerxn 

Moreover, a constraint \x\f2 > k can be expressed as 



A y^ ^yj' ^ A V «(^'yo]- 



This allows us to express e.g. \x\q = fc or x G Q. Hence, a constraint .t e L^ = (N x Zi) U 1^ in </?i can 
be replaced by the formula 

{x G iyl U Wi' U {$})* A \x\w'^ =\ h x(^U) \J x'^Vi. 

The size of the resulting sentence </?4 can be bounded by Yp\ + ©(i?'^ +i?^ • jZ'l +£• |I^|) and its quantifier 
rank is still 0{t). 

Recall that we want to check, whether 25 |= </? holds. The latter is equivalent to 64 |= <y54. By Theo- 
rem 10, this can be decided on an alternating Turing machine in time 

o(|r|OW.|^4|)<poiy(|r|OW + |^| + |i;|) 

using 0(€) < 0{\tp\) many alternations. Recall that \F\ < |Ap ('" 'P^p '' \ Hence, we can bound the 
running time by poly(|^p ('" 'P^p '" ^ + |</?| + \S\), which is doubly exponential in the input size. This 
concludes the proof of Theorem 9. 

3.2 Proof of Theorem 10 

Let us fix a finite labelled graph & = (V, S, {— >| a G S}) and let n = \V\. W.l.o.g. we can assume that 
n > 2. Moreover, we can assume that S = V x V and that the edge from a G T^ to 6 G T^ is labelled with 
(a, 6). 

Recall that for a word u E A* over a finite alphabet A and a G A, \u\a denotes the number of 
occurrences of a in u. For d > 1 and u,v <E A*,we write u =^1 v if for all a G A the following holds: 

- \u\a = |f|aOr 

- (\u\a > dand \va\ > d) 

Note that =d is an equivalence relation and that u =d+i v implies u =d v. 

Let A and B be finite alphabets. For two words u = 0102 • • • a*; G A* and v — 6162 •••&*; G S* of the 
same length k we define the convolution u® v — (ai, 6i)(a2, 62) • • • (ofe, hk) G (^ x B^ . 

Lemma 15. Let a G N, u, u G -T* (where F is a finite alphabet), u' G V* with \u\ — \u'\, u =q.„ v, and 
\v\> a ■ n ■ \F\. Then there exists v' G V* with \v\ — \v'\ and u (E" u' =„ v (8) w'. 

Proof. Let a G F and b <E V. Consider the values nia.b — \u ® w'|(a,b) ^nd Ua = \v\a- Finding a word 
v' G V* such that \v'\ — \v\ and u® u' =a v ® v' is equivalent to finding numbers na,b (which will be 

\v eg) w'|(a,6)) such that 

- J2bev "^a.b = "a for all fl G -T and 

- Tna.b = na.b or (ma.b > a and Ua b ^ (x) for all a G -T, 6 G V. 



V implies 



/ "T-a,b ^ na or ( > JTia b > Q! • n and Ua > a ■ n) 
bev bev 



for all a E F. Also recall that \V\ — n. We choose the numbers Ua^b as follows, where a G -T: 



- ^^ J2bev ^a,b == na, then we set Ua.b = ma.t for all b eV. 

- If X^hGF 'ma.b > a ■ n and Ua > a ■ n, then (since \V\ — n) there must be at least one b ^ V with 
Tia.b > «• We first set Ua.t — rna.b for all 6 G y with niafi < ct. For all remaining b ^ V (which 
satisfy nia.b > en) we set n^ f, to some value > a such that the total sum X^fcev "^a-fc becomes n^. Since 
ria > a ■ n this is possible. D 

For all k,£ E N we define an equivalence relation =k^e on the set (V*)'' of fc-tuples of words over V as 
follows: Let (ui, . . . , Ufc), (wi, . . . , Vk) £ (F*)*^. Then (mi, . . . , Uk) =k.i {vi, ■ ■ ■ , Vk) if and only if the 
following conditions hold: 



(a) For all 1 < i,j < k, \ui\ = \uj\ if and only if jwil = \vj 

(b) For alll < i < k, Ui = Vi or \ui\ > 72*^+^+^ and \vi\ > 



k+i+l 



(c) For all 1 < i < fc the following holds: Let 1 < ii < 12 ■ ■ ■ < im < khe exactly those indices such that 

\ui\ = \uii\ = ■ ■ ■ = |ui^|. Hence, \vi\ = |wij == • • • = \vi^\ due to (a). ThenM^j (gJMi^®- • ■®Ui^ =„ 
Vi^ ® Vi^ ® ■ ■ ■ ® Vi^, where a = n^+^. 

Lemma 16. Let k > 0, £ > 0, (ui, . . . , Uk) ^ki (wi, . . . , Vk), and let Uk+i G V*. Then there exists 
Vk+i e V* such that \vk+i\ < n*=+'^+i + kand{ui, . . . ,Uk,Uk+i) =k+ij-i {vi,. . . ,Vk,Vk+i)- 

Proof. Assume that (ui, . . . , Uk) =k,i (wi, • ■ • , Vk) and let Uk+i G V* . We distinguish several cases: 

Case 1. |ufe+i| 7^ \ui\ for all 1 < z < A:. 

Case 1.1. \uk+i\ < n''+^+-'-. Then, we must have |ufe+i| 7^ |fi| for all I < i < k (if \vi\ = \uk+i\ < 
^k+t+i^ then we must have Ui — Vi by (b) and hence |ufe+i| — \ui\). We set Vk+i = Uk+i- 

Case 1.2. \uk+i\ > n''+'^+^. Choose a number A with n''+'^+^ < A < n''+'^+^ + k and |i;j| 7^ A for 
alll < i < k. We will find a word Vk+i such that |fA;+i| = A and Uk+i ^a Vk+i for a = r/ . Since, 
|ufe+i| > n'^+^+^, there exists a symbol a G ^ such that |MA;+i|a > 72*^+^ > n^ = a. If A > |ufe+i|, 
then we simply increase the number of occurrences of a in Uk+i until a word of length A is reached. If 
A < |ufe+i|, then |ufc+i| > n^''"^. Hence, there even exists a eV with |ufc_|_i|a > n^. We remove one of 
the occurrences of a in Uk+i ■ We can repeat this step until a word of length A is reached. 

Case 2. |ufe+i| — \ui\ for some 1 < i < k. Let 1 < ii < 12 ■ ■ ■ < im < k ht exactly those indices such 

that \uk+i I = |uii I = • • • = l^i^ |. Let u — m^ ® Ui^ ® • ■ ■ ® Ui^ and v — Vi^ ® Vi^ ® ■ ■ • ® Wi„ . Point 
(c) implies u =„«+i v. 

Case 2.1. \uk+i\ < 71*^"''^+-'-. Hence, we have \ui\ < n''+^+-'-. This implies \ui\ — \vi\ < n^+'^+'^ by (b). We 
set Uk+i = Vk+i- Note that Vi-^ = u^j^ , . . . , Vi^ ~ Ui^ by (b). This implies 

Ui^®Ui^® ■ ■ ■ ® Ui^ ® Uk+1 =a Vi^ ^ Vt2 ^ ■ ■ ■ ^ Vi^ ® Vk+1 

for all a. 

Case 2.2. |Mfe+i| > n'^+^+^. Hence, we have |u| = \ui\ > n''+^+-'-. This implies |f| = \vi\ > n^+^+^ by 
(b). We have to choose a word Vk+i with |f/£+i| = \vi\ and u ® Uk+i =a v (E) Vk+i for a = n^. This is 
possible by Lemma 15: Note that a ■ n — r/^^ and thus u =a-n v. In order to apply Lemma 15 we set in 
addition u' — Uk+i, v' = Vk+i, and F — V"^. This implies 

\v\ > n'^+^+i =n^-n-n''>n'^ ■n-n"'^a-n- \r\. 

Hence, Lemma 15 can be applied indeed. D 

Lemma 17. If{ui, . . . ,Uk) ^k.o (^i, . . . , ffe), then (ui, . . . ,Uk) and (vi, . . . ,Vk) satisfy the same quantifier- 
free formulas in the graph 0+. 

Proof. By symmetry, it suffices to prove the following two points: 

(a) If Ui — Uj then also Vi = Vj . 

(b) If Ui — '- — > Uj for some {a,b) E V x V then also Vi — '- — > Vj. 



Let us first prove (a). W.l.o.g. assume that i = 1 and j ~ 2. Let 2 < zi < «2 < • • • < hn be those 
indices such that I ui I = \u2\ = |uij = •■• = \ui^\. Since {m, ... ,Uk) =k,Q (wi, . . . , Ufc), we get 
\vi\ = \v2\ — \vi^ ! = ■•■ = I^i™ I and wi (8) ^2 (8) Mil (8) • • • ® ""i™ =Q wi (8 W2 8) Wii (8 • • • 8) Wi„ for 
a = n > 2. Since ui = U2, all symbols that occur in mi 8) U2 C8 Wii 8) • • • 8) Ui^ are of the form (a, a, • • • ) 
for some a ^V . Hence, the same has to hold for wi 8i U2 <8 f ii (8 ■ • • 8) fi„ • But this means that vi = V2. 
For point (b), assume first that a = b. Thus, m = Uj and \ui\a > 0. By point (a), we already know 
that Vi = Vj. If z = j, then we can w.l.o.g. assume that i — j — 1. Let I < ii < i2 < ■ ■ ■ < im 
be those indices such that I ui I = |uij — ■■■ — \ui^\. Since {ui, ... ,Uk) ^k.o {vi, . . . ,Vk), we get 
K'l I = l^ii ! = ■■' — I^im I and Ml 8) Uii 8) • • • 8) Ui^ =£j wi 8) Wii 8) • • • 8) Vi^ for a = n > 2. Since 
Ml la > 0, the word mi 8) Wi^ 8) • • • 8) m^^ contains at least one occurrence of a symbol of the form (a, . . .). 

Hence, the same holds for vi (S> Vi^ 8> • • • 8) Vi^. But this means that vi — '■ — > vi. If i ^ j, then we can 
argue similarly. 

Finally, let us assume that a ^ h. We must have i ^ j. W.l.o.g. assume that i — 1 and j ~ 2. Let us 

choose the indices 2 < ii < ^2 < • • • < ?m as for the proof of point (a) above. Since mi — '- — > M2, the 
following holds for the word m = mi 8) M2 8) m^^ 8) • • • 8) Mi„ : u contains exactly one occurrence of a symbol 
of the form (a, 6, . . .) and all other symbols in u are of the form (c, c, . . .) for c G V". Again, the same has 

to be true for wi 8) M2 8) Wii 8) • • • 8) Wi„ • Hence, vi — ' — > V2. □ 

We can now prove Theorem 10. Let (p = QoXq ■ ■ ■ Qexe '. V'C^^O: • • • : xg) be a first-order formula over the 
signature of 0+, where Qq, . . . , Q„ S {V, 3} and ip is quantifier-free. For < i < £ let Li = {w ^ 
V^ I \w\ <r/ + i). By the method of Ferrante and Rackoff from Section 2.3 (with -ff (fc, t) = n^^^ + k) 
Lemma 16 and 17 imply that ©+ H 'P if and only if 

0+ h QiXQ e Lo- ■■ Qexi G Le : ■0(a;o, . . . , xe). 

This can be decided on an alternating Turing machine in time 0{n^ • |<p|) with £ alternations. 

4 An ATI ME(22'""'*"', poly (n)) lower bound 

In this section, we will prove that there exists a fixed GTRS such that the corresponding ground tree 
rewrite graph has an ATIME(2^ , poly(n))-complete first-order theory. This will be achieved using 
a suitable tiling problem. Tiling problems turned out to be an important tool for proving hardness and 
undecidability results in logic, see e.g. [3]. In a first step we will prove hardness for 2NEXP (doubly 
exponential non-deterministic time) in Section 4.2. In Section 4.3, we will finally push the lower bound to 
ATIME(22"'<"',poly(n)). 

4.1 Tiling systems 

A tiling system is a tuple S ~ {0, BH, V), where 6* is a finite set of tile types, IH C x is a horizontal 
matching relation, and V C 6* x 6* is a vertical matching relation. A mapping a : [0, fe — 1] x [0, fc — 1] — > 6* 
(where fc > 0) is a k-solution for S if for all (x, y) G [0, fc — 1] x [0, fc — 1] the following holds: 

- if a; < fc - 1, a{x, y) = 6, and (j{x + l,y)^ 9', then {9, 0') G H, and 

- if 2/ < fc - 1, a{x, y) = 9, and (t(x, y + 1) = 9', then {9, 9') G V. 

Let Solfe(5) denote the set of all fc-solutions for S. Let w ~ wq- ■ ■ Wn-i G 6*" be a word and let k > n. 
With Solfe(5', w) we denote the set of all a G Solfc(S') such that <t{x, 0) = Wx for all x G [0, n — 1]. For a 
tiling system S we define its (2'^ x 2^ ) tiling problem as follows: 

(2^" X 2^" ) TILING PROBLEM FOR TILING SYSTEM S* = (6*, H, V) 

INPUT: A word w G 6»". 

QUESTION: Does S0I22" (5, w) ^ 9 hold? 

The following proposition is folklore, see also [23]. 

Proposition 18. [23] There is some fixed tiling system Sq whose {2^ x2^ ) tiling problem is 2NEXP-hard 
under logspace reductions. 



4.2 Hardness for 2 N EX P 

Let us fix the tiling system 5*0 = (6*0, Hq, Vq) of Proposition 18 whose tiling problem is hard for 2NEXP. 
We now define a fixed GTRS Uq = (A, S, R) and prove that the first-order theory of ©(Teo) is 2NEXP- 
hard under logspace reductions. We define 

Ao = {<^,l,lt,l|,0,0t,0a, 

Ai = Oo, 

A2 = {•}, and 

U — {^, r, /i,M, m-f, mj, } U 6*0 U Aq. 



The set of rewrite rules R is given as follows: 

(1) X ^ X for each X e Aq, 

(2) X I — h X^ for each X e {1,0} (this will correspond to marking a leaf), 

(3) X^ I — > X^ for each X e {1,0} (this will correspond to selecting a leaf), 

(4) Xt hA^ 9 for each X e {1, 0}, 

(5) .(9,9)Hi^9, 

(6) e{X^) H-^ e{X^) for all 9 e Oq, X e {1, 0}, 

(7) .(9, Xf ) H^ X| for each X e {1, 0}, and 

(8) •(Xt,9) H^Xj for each Xe {1,0}. 

For the rest of this section we fix ©0 = ©(T^-o)- Let us fix an input w ^ Oq- ■ ■ 0„_i e 6*" of the (2^" x2^") 
tiling problem for Sq. Our goal is to compute in logspace from w a first-order sentence ip over U such that 

Sol22"(5o,u;)^0 ^^ ©oh'/'- 

For each subset _r C Z", we define — > = U-yer — ^- ^^^ following lemma follows immediately from 
Lemma 2 (take the formula 9{x, y) = V-yer t(^' 2/))- 

Lemma 19. Given a subset of actions F <Z S and j G [0, 2"+-'^] (in binary) one can compute in logspace 
a first-order formula r^{x,y) such that for all t,t' G Treesyi we have ©0 |= r^(t,t') if and only if 

t {^y t' in ©0. 

In case F ~ {7} is a singleton, we also write 7-' [x, y) for the formula F'-' (x, y) of Lemma 19. Moreover, 
for subsets Fi,. . . ,Fk Q S and ji, . . . ,jk G N, we write [F(^ ■ ■ ■ F-jJ' ] (x, y) for the formula 

k 
3x0,..., Xk : (xq = X A Xk = y /\ f\F^'{xi^i,Xi)). 

i=l 

A tree t G Trees^ is a tile tree '\ft — 9{t') for some t' G Trees^ such that the following holds: 

- ee0n, 

- The label of every leaf of t' is from {0,1}. 

- The distance of every leaf of t' to the root of i' is n + 1. 

- Every internal node of t' is labeled with •. 

Example 20. This is a tile tree in case n + 1 = 3: 




Let us fix a tile tree t. Note that t has precisely 2"+^ =2-2" leaves. Hence, there is a one-to-one 
correspondence between [0, 2"+^ — 1] and leaves of t by means of their lexicographic order in t. For 
each leaf A let lex(A) G [0, 2"+^ — 1] be the position of A among all leaves w.r.t. the lexicographic order 
(starting with 0). The intention is that t represents the 6'-labeled grid element (M, N) S [0, 2^ — 1]^, where 
each leaf A that is a left (resp. right) child represents the [ ^"^ -^ J '*' least significant bit of the 2"-bit binary 
presentation of M (resp. of N): In case A is a left child, then t(A) = (resp. t(A) = 1) if and only if the 
L"2^-l* least significant bit of M equals (resp. 1) and analogously if A is a right child this corresponds 
toN. 

We say a leaf A of a tree t is marked if t(A) = X| for some X E {0, 1}. We say a leaf A of a tree t is 
selected if i(A) — X^ for some X G {0, 1}. A marked tile tree is a tree that can be obtained from a tile 
tree t by marking every leaf of t. For the rest of this section, let D — 2"+^ — (n + 2). 

Lemma 21. One can compute in logspace a first-order formula marked(x) such that for every tree t G 
Trees^\{0 J Q} with precisely 2"^^ marked leaves we have: ©o |= marked(t) if and only if the marked 
leaves oft are the leaves of some (unique) marked tile subtree oft. 

Proof. The idea is to express the following: Whenever we select any of the 2"+^ marked leaves, we can 
execute from the resulting tree some sequence from the language h? ^^u^{i, rj^+^Qo- Let us explain 
the intuition behind this. Assume we have selected exactly one of the 2"+^ marked leaves of i, and let t' be 
the resulting tree. First, note that after executing the sequence h? ~^ from t' , we have replaced each of 
the marked leaves of i' with the symbol '^, reaching some tree t" , say. Second, when executing u^ from t" 
we have reached, in case t contained a marked tile subtree, some tree t'" that has a chain of the following 
form as a subtree, where X G {0,1} and 6 <E 0o'- 



9 

Finally, from t'" we can now "shrink" this subtree to the tree 0{Xf) by executing some sequence from 
{£, r}"+^ followed by executing 9. Formally, we define marked(x) as follows: 



D 



A grid tree is a tree t for which every leaf is inside a subtree of t that is a tile tree. 



Lemma 22. One can compute in logspace a first-order formula grid(a;) such that for all t G TreesA we 
have ©0 h= gi'id(i) if and only ift is a grid tree. 

Proof. The formula grid will be a conjunction of the following two statements: (i) every leaf is either 
labeled with or 1, (ii) for each leaf of t that we can mark via the action m-^, we can mark 2"+^ — 
1 further leaves reaching some tree t' with &o H rnarked(t'). Formally, grid(x) is the conjunction of 

Aaeyio\{o,i} ^a{x,x) which realizes (i) and 

Vy (m^{x, y) -^ 3z{rni ^^{y, z) A marked(z)) J , 
which realizes (ii). D 



A marked grid tree is a tree that can be obtained from a grid tree t by replacing exactly one tile subtree 
of t by some marked tile tree. A selected grid tree is a tree that can be obtained from some marked grid tree 
t by selecting precisely one marked leaf of t. 

Lemma 23. One can compute in logspace for each i G [1, n + 1] a first-order formula biti(x) such that 
for every selected grid tree t with selected leaf X we have that the i''' least significant bit o/lex(A) is 1 ;/ 
and only if&Q \= biti{t). 

Proof We define bit, (x) = 3y:[h'^^^^-'^u^{l,rY-'^r]{x,y). U 

Lemma 24. One can compute for each o g {<, —^ n logspace a first-order formula ipo{x, y) such that for 
every two selected grid trees ti and ^2 with selected leaves Ai and X2 we have ©0 H Vo {ti , ^2) if and only 
if\ex{Xi) o lex(A2). 

Proof. We only treat the case when o equals <; its definition should be self-explanatory: 

V K-bit,(a;)Abit,(y))A /\ (bit,(x) O bit,(y)) 

iG[l,ri+l] ^ l<i<i 

D 

Recall that the marked tile subtree of a marked grid tree t represents a ^-labeled grid element (M, N) e 

[0, 2^" - 1]2 for some e Oq- Therefore, let us define M{t) = M, N{t) = N, and 6>o(t) = 0. 

Lemma 25. One can compute in logspace first-order formulas ipg{x), ipi^M{x,x'), (pi^N{x,x'), where 
<E 00 and i G {0, 1} such that for all marked grid trees t and t' the following holds: 

(1) ©0 h ^9{t) f and only fOa{t) = 0, 

(2) ©0 h V'»,M(i,i') if and only if M{t) +i = M{t'), and 

(3) ©0 h iPi,N{t,t') if and only if N{t) +i = N{t'). 

Proof. For point (1) we define ipe{x) as follows: 

3y : [m^h^^^'-\° {i,r}''+^e]{x,y) 

For the remaining points (2) and (3), we only give the formula ipi^M{x, x'), i.e. we wish to express that for 
any two marked grid trees t and t' we have ©0 |= (pi^M{t, t') if and only if M{t) + 1 = M{t'). Let us 
fix two marked grid trees t and t' . Assume we have selected among the 2"+^ marked leaves of t some leaf 
A. Recall that A represents one of the 2" bit positions of M{t) if and only if A is a left child, otherwise it 
would represent a bit position of N{t). Hence we will only be interested in leaves of t and t' which are left 
children. For this sake, let us express that the selected leaf of a selected grid tree z is a left child via the 
formula left(z): 

left(z) ^ 3z',z"ihiz,z')A£iz',z")) 

Our formula (Pi,m{x, y) is defined as follows: 

3x',2/'(mj(x,x') Amx{y,y') Aip={x',y') A(D|(x',x') Alf(y',y') A left(x') A V'l A -02) • 

Thus, we select a postion p G [0, 2" — 1] that is set to (resp. 1) in the binary representation of M(i) (resp. 
M{t')). The formula ipi{x, y, x' , y') is defined as 



Vzi {mx{x,z) A (p<(z,x') A left(z)) — !> lx{z,z 
"iz i (m|(j/, z) A ^<(z, y') A left(z)) ^ (Dt(z, z)\ 



It expresses that each bit at some position that is smaller than p is set to 1 (resp. 0) in the binary represen- 
tation of M{t) (resp. M{t')). The formula 1/12 expresses that the binary representations of M{t) and M{t') 
agree on each position that is bigger than p. Formally, ip2(x, y, x' , y') is defined as 

\/u,vl (?7if (x, u) A ■mx{y,v) A ip^{u,v) A ip^{x' ,u) A left(u)) -^ (1|(m,u) ^ l|(w, w)) j . 

D 



We define the formula sol(a;) as the conjunction of the following formulas, where mark(zi, 22) is an ab- 
breviation for m? (zi, Z2) A marked (22): 

- a; is a grid tree: 

grid(x) 

- Whenever we mark two tile subtrees of x that represent the same grid element, their ©-labels agree: 

Vy, z ( (mark(x, y) A mark(a;, z) A (pa,M{y, z) A (po,N{y, z)) -^ /\ {(peiy) o (feiz)) 
^ eeOo 

- Whenever we mark a tile subtree of x that corresponds to the grid element (M, N) and M < 2^ — 1 
there exists some tile subtree of x that corresponds to the grid element (M + 1, N) and the horizontal 
matching relation is satisfied: 

Vyl (mark(a;,y) A 3z(m|(y, z) A <D^{z,z) A left(z))) -^ 

3y'{mark{x,y') Aipi^Miy,y') /\'PaMiy,y') /\ V ('^e(y) A (^e'(y'))) 

(e,e')eiHo 

- Analogously to the previous formula, we can express that whenever we mark a tile subtree of x that 
corresponds to the grid element (M, N) and N < 2^ — 1 there exists some tile subtree of x that 
corresponds to the grid element (M, N + I) and the vertical matching relation is satisfied. 

Finally we can construct a formula (pw(x) that guarantees that grid element {j, 0) is labeled by 9j (recall 
that w ~ 9o ■ ■ ■ dn-i) for each j G [0, n — 1]: 

32/o,---,2/n-i[ /\ {mark{x,yj) Aipe^iyj)) A yz{m^{yo,z) ^(Dx{z,z)) A 
^je[o,n-i] 

/\ i(pi,Miyj-i,y]) A (/3o,Ar(yj-i,%)) 

je[l,n-l] 

Our final formula (^ is defined as 93 ~ Ela;(sol(x) A (/3^(x)). It follows by construction that S0I22" (SojUi) ^ 
if and only if 0o H f- 

Theorem 26. The first-order theory of&Q is 2NEXP-hard under logspace reductions. 

4.3 Pushing hardness to ATIME(2^''°'^ " , poly(n)) 

Let us fix a tihng system S ~ {0, DH, V). Given a, a' G Solfc(S') we say a' extends a vertically if cr'(x, 0) — 
a{x, fc — 1) for each a; G [0,k — 1]. Let Solfe(S', <t) be the set of all <t' G Solfe(S') such that a' extends cr 
vertically. The standard encoding of Turing machine computations into tilings shows that there is a fixed 
tiling system Si ~ {0i,t 
under logspace reductions. 



tiling system Si ~ {0i, DHi, Vi) such that the following problem is hard for ATIME(2^ , poly(n)) 



Linearly alternating (2^" x 2^") tiling problem (for 6*1) 

INPUT: A word w = 6061 ■ ■ ■ 9n-i € 0"^, where n is odd. 

QUESTION: Does Bcti G S0I22" (5i, w;)Vcr2 G S0I22" (S*!, cri) • ■ • 3(t„ G S0I22" (5i,ct„_i) : true hold? 

Let 01 be the fixed GTRS graph that is obtained from ©0 of Section 4.2 when we replace the tiling system 
^oby^i. 

Corollary 27. The first-order theory of&i is hard for ATIME(2^ , poly(n)) under logspace reduc- 
tions. 



Proof. We recycle the proof presented in Section 4.2. We adapt the formulas constructed in Section 4.2 to 
the fixed tiling system Si (instead of 5*0). Recall we can compute in logspace a formula sol(x) such that 
for every tree t we have that 0i |= sol(f) if and only if t corresponds to a 2^ -solution for ^i. It is an easy 
exercise to construct in logspace a formula ext such that for any two trees t and t' each satisfying sol we 
have ©1 1= ext(i, t') if and only if the solution corresponding to t' extends that of t vertically. We obtain 
that a word w (with n = |w| odd) is a positive instance of the linearly alternating (2^ x 2^ ) tiling problem 
if and only if 0i is a model of the sentence 



3x1 ( sol(a:;) A (/3^(a:;) A Va;2 ( (soI(.T2) A ext(.Ti, 2:2)) -> ••■ 3x„ (sol(a;„) A ext(a;„_i, x„)) 

D 

We should remark that hardness for ATIME(2^''°*^ " , poly(n)) can be also proved using the method of 
Compton and Henson [9] (monadic interpretation of addition on large numbers). The use of tilings has the 
advantage of giving an almost generic reduction. On the other hand, the method of [9] yields completeness 
under the slightly stronger log-lin reductions. 

5 The first-order theory with regular unary predicates 

For a GTRS TZ = {A, S, R) and a set of trees L C Trees^i, we denote with ((&{TZ), L) the structure that 
results from the labelled graph ©(7?.) by adding the set L as an additional unary predicate. Note that if L 
is a regular set of trees, then ((&{TZ), L) is a tree automatic structure, and hence has a decidable first-order 
theory. 

Theorem 28. There exists a fixed GTRS TZ2 = {A, E, R) and a fixed regular tree language L C Trees^i 
such that the first-order theory o/(0(7?,2), L) is non-elementary. 

PROOF (sketch). The proof idea is an adaption of the proof of Theorem 2 in [14] and is hence only 
shortly sketched. We reduce from the satisfiability problem for first-order logic over binary words. Binary 
words are considered as structures over the signature (Pg, Pi,'^), where Pq and Pi are unary relations 
(representing those positions, where the letter is and 1, respectively), and where < is the natural order 
relation on positions. The idea is that a tree t G Trees^i (where A2 = {•} and ^0 — {0, 1}) corresponds 
to the unique word over {0,1} that one obtains by simply reading the yield string (the sequence of node 
labels when traversing the leaves in lexicographic order) of t. Let yield (i) denote the yield string of t. 

We translate a given first-order sentence (p over the signature (Pqj A , <) into a first-order formula t/)( a;) 
over the signature of (0(7?,2), L) such that for every tree t G Trees^i we have: yield (t) \^ (p if and only 
if (6(7?,2), L) \= ipit). Assume that xi, . . . ,Xn are the variables that occur in tp. Bounding a variable Xi 
(1 < i < n) of (pto a certain position in the word yield (t) is simulated by labelling the corresponding leaf 
of the tree i by a chain of unary symbols of length i. In order to keep the GTRS 7?^2 fixed, this chain has to 
be built up in i rewrite steps that are controlled by the formula ip{x). In order to verify an atomic predicate 
Xi < Xj in the tree t one has to check, whether the i-labelled node of t is lexicographically smaller than 
the j-labelled node. To do this using a fixed GTRS, one first replaces the chain of length i (resp., j) that 
identifies the position to which Xi (resp., Xj) is bound by a special constant a (resp. b). Again, this process 
has to be controlled by the formula tp{x). Finally, we can check Xi < Xj using the regular set of trees that 
contain a unique a-labelled leaf and a unique 6-labelled leaf, and the a-labelled leaf is lexicographically 
smaller than the 6-labelled leaf. This regular set will be the set L in the theorem. D 
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